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Abstract. In a CNF formula, we say that a pair C, D of clauses consti- 
tutes a conflict if there is a variable that occurs positively in one clause 
and negatively in the other. We show that any a fc-CNF formula with 
less than O (2.69 fe ) conflicts is satisfiable. 



1 Introduction 

Our goal is to find sufficient criteria for satisfiability of Boolean for- 
mulas. To be meaningful, these criteria must be easy to check, should 
contain certain "high-level" information about the formula, such as size 
and number of its clauses, but abstract from the small details of the for- 
mula, like which literals exactly some clause contains. For example, any 
fc-CNF formula with less than 2 fe clauses is satisfiable. This criterion is 
nice because it forgets almost all information about the formula. 

In this paper, we give another sufficient criterion for satisfiability, in 
terms of the number of conflicts of a formula. Formally, a conflict of F 
is a pair {C, D} of clauses in F such that there exists a variable x with 
x G C and x £ D (or vice versa). The set of clauses D 6 F that con- 
flict with C is denoted by 7V(C), and c(F) denotes the total number of 
conflicts in F. It is clear that a formula without conflicts is satisfiable, 
as every variable occurs either only positively or only negatively. Fur- 
ther, intuitively a fc-CNF formula with "few" conflicts is satisfiable, too. 
Giving bounds on what "few" exactly means is the goal of this paper. For- 
mally, let Cfc := max{c | every fc-CNF with < c conflicts is satisfiable}. 
We want to determine the growth of Ch- 

This paper is meant as an informal follow-up to [1], however, we try to 
keep this paper self-contained. The best known bounds for the parameter 
Ch are from [1] 

'4 fc log3fc' 



«(2.32 fc ) < c fe < O 
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1.1 Results 



Our central technical tool is the famous Lovasz Local Lemma (actually 
a more general version of it, called the Lopsided Lovasz Local Lemma), 
see e.g. [2] . Usually stated in more abstract terms, we present a version 
that is already "compiled" into terms of CNF formulas. 

Lemma 1 ([1]). Let F be a CNF formula not containing the empty 
clause. Let it be a probability distribution under which all variables are 
independent. If for every C € F, it holds that 



then F is satisfiable. Here Pr,r [->!)] denotes the probability of D being 
unsatisfied under a random assignment sampled according to distribution 

IT. 

For a literal u, let occf(m) denote the number of clauses of F containing 
u, and occf(m) the number of those containing u. The following result 
follows immediately from Lemma 1, by taking it to be the uniform dis- 
tribution. 

Corollary 2 (basically given in [3]). If F is a k-CNF and \r F (C)\ < 
2 k ~ 2 for every clause C, then F is satisfiable. Thus, c k > 2 fc ~ 2 . 

Corollary 3. If F is a k-CNF, and occf(u) < for every literal u, 
then F is satisfiable. 

Proof. Note that \r F (C)\ < J2 u ec occ F (u) < fc|£. By Corollary 2, F is 
satisfiable. □ 

Observation. c(F) > occf(w) • occf(w) for every u. 

In every unsatisfiable fc-CNF formula with only "a few" conflicts there is 
a literal u which occurs "often" (by Corollary 3), and its negation occurs 
only "a few" times (by the above observation). It is natural to define the 
probabilities in such a way that it is more probable to set u to true. If 
we do this right, we will be able to prove 

Theorem 4. Every k-CNF formula F with less than 2< 128 ^' > conflicts 
is satisfiable, where ct^ is the smallest integer £ such that fl(x) < i for 
all x G [0, 1], and fk{x) = x y/1 — x. 

It is easy to see that this number at exists for every k. Furthermore, we 
can prove 

Theorem 5. 
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Corollary 6. 



Cfe > n(2 (2 -°- 572)k ) > n{2.69 k ). 



2 A Qualitative Idea of Our Approach 

In this section we give a very informal outline of our approach. We do 
not want to bother the reader with calculations in this section, hence we 
use terms like "few" , "many" , "small" etc. in a deliberately vague way. 

Imagine somebody gives you a fc-CNF formula F with "few" conflicts, 
say 2.2 fc , just to give some concrete number, and you want to prove that 
F is satisfiable. As a first step, you might check whether there is a literal 
u occurring in > |_ clauses. If not, F is satisfiable by Corollary 3, and 
we are done. If yes, consider some literal u with occf(«) > fqr. Recall the 
previous observation: c(F) > occf(u) • occf(w). In other words, occf(w) 
is small, in our case occf(w) < 4fc 2 2 fc ' 2 = 4fc ■ 1.1 . 

Recall that in Lemma 1, we may choose any probability distribution, 
as long as all variables are set independently. So if occf(u) is large and 
occf(u) is small, why not set u to true with probability > |? This 
sounds reasonable, because occf(m) many clauses "benefit" from this, 
while only occf(w) many clauses "suffer". For example, for a clause D 
containing u we now have Pr-^D] < 2~ fe . One might hope that using 
this non-uniform distribution, we can show that 



and thus, by Lemma 1, conduce that F is satisfiable. If one does the 
math, it turns out that in fact a fc-CNF formula F is satisfiable if it has 
less than O (t^-) conflicts. We conclue that c k > Q (t^-J- This is, 



To achieve this, we have to apply a trick. Instead of estimating the sum 
in (2) for F, we estimate it for a new formula F' . We obtain F' from F by 
deleting certain literals from certain clauses. Roughly speaking, if u £ D, 
D has only few conflicts, Pr^[^D] is not small enough and occf(w) is 
too large, we replace D by D \ {u}. If we do it right, (2) will hold for the 
resulting formula F'. Thus, by Lemma 1, F' is satisfiable, so F is, as well. 

Let us become a little bit philosophical and ask why deleting certain 
literals helps. Clearly, the resulting formula F' is harder to satisfy than F, 
meaning that is has fewer satisfying assingments. However, by removing 
certain literals from certain clauses, we are "thinning out" the conflict 
structure of F, to a point where we can apply the Lovasz Local Lemma. 
So to speak, there is less gold buried in F' than in F, but in F' it is 
closer to the surface. 

3 Proof of Theorem 4 

Let F be a fc-CNF and let c(F) be the number of conflicts of F. Through- 
out the proof, we will use the convention that x denotes a variable occur- 
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would like to prove a bound of 17 ( 7 fc ) for some 7 > 2. 



ring in F, i.e., a positive literal, whereas u denotes a positive or negative 
literal. Further, we can assume w. 1. o.g. that occ_f(:e) > occf{x), for 
every variable x, and that no variable is pure, i.e., whenever x occurs in 
F, then also x occurs in some clause. We define p(x) = Pr[x = 1] by 

, s ( 1 occf(x)\ 

Also, we set p(x) = 1 — p(x). Indeed, this is a probability distribution 
because c(F) > occf{x) for every variable x. We choose the truth value 
of each variable independently according to p(x). It holds that 



i \ 1 i / \ k occf(x) 1 

p(u)<-^« = xaiidp( a; )= ^ TH -A^>-, 

1 k occ F (x) 1 k occ F (x) 1 

^=2^ "2 ^ Vl^) -2" 

We distinguish two types of clauses: 6ad clauses, which contain at least 
one literal u with p(u) < | , and good clauses, which only contain literals 
u with p(tt) > |. We subject every good clause of F to the following 
deletion process: 



Algorithm:Deletion Process 

Let Q = {D £ F : > |,Vw € 73} 
foreach D £ Q do 

j = fc 

Z) = {Wl, M2, • • • , Ufe} with OCCf(ui) < . . . < OCCF(Wfe) 

while PrhD] > sk J F(uj) and D ± □ do 

D — D \ {uj} 

3=3-^ 
end 

end 



Note that the probabilities are not changed during the algorithm and 
also the occurences occ F (u) remains unchanged, i.e., according to the 
original formula F. 

Denote by F' the resulting formula of the deletion process. We have either 
□ G F' , i.e., some clause has lost all its literals in the deletion process, 
or Pr[^D] < (8fcmax ue i5 occF(tt)) -1 for every good clause in F'. The 
proof now proceeds in two steps. First, we show that if □ F', then F' 
(and thus F) is satisfiable. Secondly, we finish the proof by showing that 
no clause has been completely deleted, i.e., □ ^ F' . 

Lemma 7. Let F' be the result of the deletion process described above. 
//□ F', then F' and F are satisfiable. 



Proof. We apply the Lovasz Local Lemma with the probability distribu- 
tion described above. For each clause C G F' , we partition the neighbor- 
hood r F i (C) into B and Q, the set of bad and good clauses, respectively. 
Our goal is to show that 

E p *h D }<\ 

Der F ,(C) 

for each C € F' . Then it follows from Lemma 1 that F' is satisfiable. 
First, we estimate ^2 DeB Pr[-iD]. Since bad clauses are not subject to 
the deletion process, each bad clause still has k literals. A clause D G B 
contains at least one literal that is satisfied with probability < |, which 
must be a negative literal. Let id be the literal in D with the smallest 
probability of being satisfied. Then, 



Pr[-.D] < p(x D ) k = 



occf(^d) 



16c(F) 

by the definition of the probability distribution. Furthermore 
c(F) > \ Y, \ F ^°)\ ^ \ E occ F ( XD ) 



2 ' 2 

DeB DeB 



where we put the ^ because some conflicts might have been counted 
twice. Together this yields 



y PrhD] < y 2ss£^l < I 



In a second step, we will bound ~^2 DeS Pt[-iD], i.e., the sum over all 
good neighbors of C. First note that for each D G Q, there is at least one 
"responsible" literal u with u G D , u G C '. This literal can be responsible 
for at most occ F /(u) < max„ e ii occf(m) clauses D G Q. Also recall that 
Pr[^D] < (8fc maxugu occf(m)) -1 . We calculate 

y^¥x[-^D\ < y^ E 8fcmax ueD occf(m) ~ ^ 8kocc F >(u) ~ 8 ' 
Dee aec Dee v ; uec DeQ K ' 

We conclude that X^Der F /(C) P r [^^] < 41 an d therefore, by Lemma 1, 
F' is satisfiable. Thus F is satisfiable, which proves the lemma. □ 

Lemma 8. If c(F) < 2<2 12 °^ ) '" i/ien the deletion process described above 
does not create an empty clause. 

Proof. Suppose, for the sake of contradiction, that it does create an 
empty clause. Let C = {ui,U2, ■ ■ ■ ,Uk} be a good clause of F that be- 
comes the empty clause during the deletion process. Order the m such 
that occf(wi) < ••• < occi?(ufc). Since the deletion process deletes all 
literals m from C, the inequality 



holds for every 1 < I < k. Write pi := p(ui) and note that pi > 
\J °i6e(F) "^ (either Ui = x, or Ui = x and p(x) = p{x) = |). Inequal- 
ity (3) now becomes 

u^-^^usdm (4) 

Note that C does not contain bad literals, thus every literal is set to true 
with probability at least | and therefore all pi G [|, 1]. We will argue 
that such numbers pi, . . . ,pk cannot exist for the our values of c(F). 

Claim. If there are values p\ , . . . , pk satisfying inequalities (4) for all 
1 < £ < k, then there are values qi, . . . , qu such that for each £ 



- l28kq$c(F) ' 

and for all qi > \, this inequality is satisfied with equality. 

Proof. Just start with qi and try to make it as small as possible, but not 
smaller than i. Note that q\ being small makes it only easier to satisfy 
inequalities 2, . . . , k. Then do the same with p2, P3 and so on. Formally, 
for I = 1, 2, . . . , k, define 









{.. 







By induction, qi < pi for all i < £, and the set above is non-empty, 
because it contains pi, and it is closed. Therefore, qe is well-defined, 
and qt < pe. Clearly, if qi > |, then the £ th inequality is satisfied with 
equality. □ 

Let (qi, . . . , qu) be as in the claim. If all qt's are equal to | then the k th 
inequality reads as ^ > 12Sk2 -k c(F) , and thus c(F) > , which 

contradicts our assumption. Therefore we can assume that the minimum 
I = min{i | g» > 5} exists. We obtain for I < t < k that 

128fcg* +1 c(F) = (1 " qt+l) " 9l) = l28fe?MF) ' 



and solving for q t , we obtain q t = fk(lt+i) f° r fk{x) := x tyl — x. Con- 
sequently, we obtain fk~ e (qk) = qe > \ (note here that this is also for 
£ — k true). By definition of a*,, this means that k — I < ak — 1. Now 
focus on the I — 1 st inequality: 



128fc^_ 1 c(F) 



Using the fact that q 1 = q 2 = ■ ■ ■ = = |, by the definition of £, and 
that k — £ < ak — 1, we obtain 



128fcc(F) 



Re-arranging terms, we obtain c(F) > 2 ^ ° fc fc , clearly a contradiction to 
our assumption. Therefore the deletion process cannot produce an empty 
clause. □ 

Lemma 7 and 8 together imply Theorem 4. 



4 Proof of Theorem 5 



As promised, we want to estimate the values of a k , or rather, compute 
limfe^oo (and of course show that the limit exists). Recall the defini- 
tion Q fe = min{£ | f£(t) < \ Vt G [0, 1]}, where f k (t) = tfyT^t. Since 
/fe has a unique maximum on [0, 1] which is attained at t* := and 
is monotonically increasing for all t < t* and satisfies fk(t) < t for all 
t G [0, 1], we get fl(t) < flit*) for all t G [0, 1]. We generalize the def- 
inition of a k by defining S k (ti,t2) := min{£ 

I fk(ti) < *2>. Obviously, 
Qfe = S k (t*, |). Note that S k (ti,t 2 ) is finite for t 2 > 0, and S k (t,t) = 0. 

Observation: For < t 2 < ti < 1, we have 

S*(ti,t2)-l<S*(t*,ta) ti)<S*(ti,t2) • (5) 

This follows from basic observations about iterated application of a func- 
tion /fe with fk(x) < x. If you will, you can view S k as a metric on (0, 1]. 
The second inequality is thus simply the triangle inequality, and the first 
inequality states that triangles in this metric are "very flat" . 



Lemma 9. Let S k (ti,t 2 ) be defined as above, and s k (t) := ^S k (t*,t). 
Then s k (t) converges for every t G (0, 1] to 

f 1 1 

s(t) := hm Sfe(i) = - / — — -da; 

k^oo J t xln(l — x) 



Proposition 10. For < t 2 < ti < 1 

\nt 2 - lnti ]nt 2 - lnti , . 

^(i-to - Sk(tl ' t2) - k Hi-t 2 ) + 1 (6) 

Proof. We first prove the first inequality. Note that t 2 > f k k {ti). 
We compute for i > 1 

because fl(ti) < t\. Iterating i times, we obtain f k (ti) > ti(l — ti)T* . 
Plugging this bound into t 2 > f Sk(tl ' t2) (ti) and solving for S k (ti,t 2 ) 
proves the first inequality. 

For the second inequality, note that f k (ti) > t 2 for all £ < S k (ti,t 2 ) — 1. 
Therefore f k (ti) = f~\U) ^/l^fij < ft^ti) VT=T 2 . By iter- 

ating this argument, we obtain t 2 < f^^ 1 ^ 2 ^ 1 (ti) < ti(l— t 2 )^ £ 
Solving for S k (ti,t 2 ), we obtain the claimed upper bound. □ 



Combining (5) and (6) we obtain 



lnfa-lnfa 1 lnfa-lnfa 1 

In(l-fa) - * " Skit2) ~ Skitl) ~ HI fa) + jfc • (?) 

Note that althouhg we derived these inequalities for < fa < fa < 1, it 
is easy to see that they also hold for < fa < fa < 1. The idea of the 
proof is the following: We compute the derivative of Sk (never mind for 
the moment that Sk is not even continuous) by computing s fcfa)-«fc(«i) 
for fa — > fa. Imagine that k = oo, then the upper and lower bound of 
(7), devided by fa — fa, converge to the same function g(t). We can now 
compute Sk(t) by integrating g(t). The problem of course is that Sk(t) is 
a step function, thus not continuous, and we do not know (yet) whether 
lmifc-^oo Sk(t) exists. We need the following lemma. 

Lemma 11. Let s k (t), k € N, t € (0,1] be a sequence of functions with 
•Sfc(l) = for all k. Assume that for all < fa < fa < 1, we have 

A(t u t 2 ) - -j- < Sfe (fa) - sfc(ti) < B(fa,fa) + i (8) 

w/iere ^4, B : (0, 1] x (0, 1] — » R are continuous, A(t,t) = B(t,t) = for 
all t, and lim/ t _,o A ( t '* +fe ) — limh^o B( - t '^ +h) = g(t). T/ien 



lim Sfc(t) =: s(t) = — g(x)da 



fc— >o 

In particular, the sequence s k (t) converges to a continuous function. 

Before we prove Lemma 11, let us show how to prove Lemma 9. We apply 
Lemma 11 with 

_ In fa - In ti _ In fa - In fa 

A{tlM) m(l-fa) B(tl '' 2) - In(l-fa) 

Note that these functions satisfy the requirements of Lemma 11. In par- 
ticular, 

lim A(t,t + h) = lim B(t,t + h)_ 1 



h^o h h^o h t ln(l — t) 

Now, Lemma 9 follows from Lemma 11. 



Proof (of Lemma 11). First plug in fa = 1 into (8) to observe that 
Sk{t2) is bounded from below and above for all < fa < 1. Therefore, 
s(t) := limsupsfe(f) exists, and similarly s(t) := liminf Sk(t). Note that 
we do not even know at this point whether s k {t) converges, therefore we 
can only use limes superior and limes inferior. 

Proposition 12. : Let a k , bk be two sequences of real numbers which are 
bounded from above and below. If there are L, U £ R such that L — e k < 
cifc — bk < U + Ek, where Ek — » as k — » oo, then 

L < lim sup a k — lim sup b k < U 
L < lim inf — lim inf bk < U 



Proof. First note that in general lim sup — limsupfefc 7^ lim sup (a^ — 
&fc). We will proof that limsupafc — limsupfefc < U + e, for any e > 0. 
This will prove one claimed inequality. The other three inequalities can 
be proven similarly. So let e > be given. For all sufficiently large k, 
£fe < § . By definition of lim sup, we have au > lim sup at — § for infinitely 
many k. Hence we also have bk > au — U — > lim sup at — | — U — | 
for infinitely many k. Therefore limsup&fc > limsupa fc — U — e, and the 
inequality follows. □ 



We apply Proposition 12 with at = Sk{t + h), bk = Sk(t), L = A(t,t + h), 
U = B(t,t + h) and e k = ±: 



We divide both lines by h and let h go to 0. We obtain s'(t) = s'(t) = g(t). 
Note that this implies as well that s(t) and s(t) are continuous. Since 
s(l) = s(f) = by assumption on Sfc(t), we have 



Hence the Sk{t) converge to a continuous function s(t), as k grows. This 
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A(t, t + h) <s(t + h) - s(t) < B(t, t + h) 
A(t, t + h) <s(t + h)- s(t) < B(t, t + h) 




proves the lemma. 



□ 



